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CHARACTERIZATIONS OF REAL LINEAR ALGEBRAS* 

By Anthony P. Cotroneo 
Langley Research Center 

SUMMARY 

Some characterizations and properties of linear algebras (or hypercomplex sys- 
tems) over the field of real numbers are presented with essential definitions and theorems 
concerning real linear algebras on which neither the commutative property nor the asso- 
ciative property of multiplication is defined. Theorems relating the concepts of normed, 
absolute valued, and division algebras are also given. 

In addition, algebras which are commutative or associative with respect to multi- 
plication are considered. A construction of the algebra of real quaternions is given with 
a proof of the classical result of Frobenius which illustrates the unique place of complex 
numbers and quaternions among the algebras. Except for isomorphisms, the real num- 
bers, the complex numbers, and the algebra of real quaternions form the only associative 
division algebras over the field of real numbers. 

The algebra of real quaternions is discussed in detail. It is shown that all automor- 
phisms on this algebra are of a specific type. These automorphisms form a group of 
linear orthogonal transformations which in turn define the group of all rotations on the 
real Euclidean vector space of dimension three. The technique of using quaternions to 
describe rigid-body rotations is especially useful in eliminating the singularities (gimbal 
lock) existing in the Euler angle rate equations. The illustrations presented point out 
many ways in which quaternions can be handled. 

INTRODUCTION 

The study of linear algebras (or hypercomplex systems) began with W. R. Hamilton's 
discovery of quaternions. Hamilton was then primarily interested in the solution of two 
problems: 

(1) Given an n dimensional vector space, is it possible to define multiplication in 
such a way that the resultant system is a field ? 

*The information presented herein was submitted as a thesis in partial fulfillment 
of the requirements for the degree of Master of Arts, the College of William and Mary 
in Virginia, Williamsburg, Va., 1965. 


(2) Can the product of two sums of n squares be expressed as a sum of 
n squares ? 

Hamilton defined a quaternion to be a quadruple of real numbers with the operations 

( a i> “4) + (0i» h> h> h) = (“1 + h> °2 + “3 + ^3’ “4 + h) 

and 

( a V °®2’ “4) • (^1» &2’ % ^4) = ( r V y 2’ r 3’ r 4) 

where 

y \ = “1^1 " <^2 ‘ ^3^3 ' V4 

^2 = “ 1^2 + “ 2^1 + ^ 3^4 - a 4 % 

y 3 = “l^3 " “2^4 + “3^1 + a 4^2 
r 4 = 0^4 + o^g - Og0 2 + a 4 / 3 x 

He showed that all the axioms for a field were satisfied with the exception of the commu- 
tative law of multiplication. He was also able to obtain the striking identity 

( 2 2 2 2\( 0 2 0 2 0 2 0 2 \( 2 2 2 2 \ 

K + °2 +a 3 + a 4 ) • V^l + h + P* ) = { y l +y 2 +y 3 + y 4 ) 

Hamilton's discovery led to a great deal of interest and study in the areas of both 
linear algebra and applied mathematics. In mechanics, for example, quaternions have 
proved to be a useful tool in the representation of rigid-body rotations. Therefore, the 
algebra of real quaternions is treated in detail herein and should be particularly useful 
to individuals interested in the application of quaternions to describe rigid-body rotations. 

The objective of this paper is to present some characterizations and properties of 
both nonassociative and associative linear algebras over the field of real numbers. Also 
included is a construction of the algebra of real quaternions from the system of complex 
numbers. By use of quaternions, one can construct still another but less attractive 
algebra, the eight dimensional Cayley algebra. Because defining this system is very 
involved, the system of Cayley numbers has not been presented in this paper. For a 
discussion of the properties and a proof of the uniqueness of this system, the reader is 
referred to references 1, 2, and 3. 
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Many of the theorems discussed in this report are treated in the various refer- 
ences. Because of the author’s interest in the application of quaternions to rigid-body 
motion, this paper was written to present a logical and formal organization of pertinent 
material dealing with this subject. 

GENERAL CHARACTERIZATIONS 


General characterization theorems of algebras over the field of real numbers are 
presented, with some definitions essential to the presentation. 

Definition: Let A be a vector space of finite dimension n over the field R. 
Then A is a linear algebra of order n (or simply an algebra) if there is defined on A 
a product xy which satisfies the conditions 


x(o:y) = (o!x)y = a(x y) (for a in R and x, y in A) 


x(y + z) = xy + xz 
(y + z)x = yx + zx 


(for x, y, z in A) 


If R is the field of real numbers, A is called a real algebra. Also, if A contains 
an element e such that ex = xe = x for all x in A, A is said to be an algebra 
with identity and this element is denoted by "1.” The definition does not assume commu- 
tativity or associativity of multiplication on the algebra A. 

From this definition is derived the following useful representation of a product 
in A. Let e^, eg, • • •> e n be a basis for A, and let 


x = x^e^ + Xgeg + 


+ x„e„ 
n n 


and 


y = y 1 e 1 + y 2 e 2 + . . . + y n e. 


be any two elements in A. Then 


n 


xy= £ Xjyjejej 
i,j=l 
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is an element in A. Therefore, 


n 

e i e j = J, njk e k (*> J ■ x > 2 n ; njk to R ) 

k=l 

so that 

n 

- X Vk 

k=l 

n 

z k = £ Wijk (k = 1, 2, . . n) 

i,j=l 

Q 

Thus, multiplication of elements in A is completely determined by n constants 
These constants are called the structure constants of the system. Throughout this paper, 
R denotes the field of real numbers and it is understood that A is of finite dimen- 
sion n. Also, to avoid confusion, in some instances a dot is used to indicate vector 
multiplication. 

Definition: Let A be a real algebra; A is said to be absolute valued if there is 
a function <p on A to R such that 

m = o 

c p(x ) >0 (if x / 0) 

0(xy) = 0(x) <p( y) 

<M X + y) = 0(x) + 0(y) 


and 


cp(ax) = | Q! 1 0 (x) 

for all x, y in A and a in R. If these properties hold, <p is designated an abso- 
lute value function on A. If <p(x y) S <f>(x)<£(y), A is said to be a normed algebra and 
<p is called a norm function on A. 
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Multiplication 


Theorem 1: Every real algebra is a normed algebra. 

Proof: Let A be areal algebra having a basis e^, e 2 , • . e n< 
on A is defined by 


e i e j = I njt'k 

k 


where the 



are real. 


Now, let 


— o?e* (i — 1, 2, • • 

for any nonzero real number cl Then u^, Ug, • • •> u n forms a new basis for 
that 


uu. = 
i 3 


6 ijk u k 


and where 6^ = ofy^* Let 


x = 


2 


x i u i 


and 


y= i y j u j 

3 

be any two elements in A; then 

*y= 

k 

where 

z k - 1 x i y j 6 ijk 


, n) 

i. such 
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Now choose a such that 


so that 



(i, j, k = 1, 2, . . ., n) 


N s s I h y i 
y 


Now define 


4>( w) = |w| = |Wj| + |w 2 | + . . . + |w n 


for every 


w = ^Wi u i 

i 

in A. Since 

0(xy) = jxy| = £ |z k | S ^ | Xi | | Yj | = |xj jy| 
k i,j 


and 


cp(x + y) = |x + y| = ^ | x i + Yi| = | x | + jy| 
i 

c p is a norm function on A and clearly satisfies the remaining properties which must 
hold for a norm function. Hence A is a normed algebra and the proof is complete. 
This theorem is treated in reference 1. 

Definition: A is a division algebra if the equations ax = b and ya = b always 
possess solutions for a 4- 0. 

Theorem 2: Every real absolute valued algebra is a division algebra. 
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Proof: Let A be a real algebra with absolute value function 0. For some a 
in A, define the mappings 


xR„ = xa 

ci 


(or R a : x — xa) and 

xL a = ax 

(or L a : x — ax) for all x in A. Then R a and L a are linear transformations 
on A. If a ^ 0 and x ^ 0 , then 0(a) > 0 and 0(x) > 0 implies that 0(xa) > 0 and 
hence xa ^ 0 . Therefore, if a ^ 0 , the null space of Rg^ consists of the zero vector 
alone and similarly the null space of L a consists of the zero vector alone. Hence, if 
a 4 0 , R a and L a are nonsingular linear transformations on A. Thus, it follows 
that the equations xa = b and ax = b can be solved when a ^ 0 . This theorem is 
discussed in references 1 and 4. 

Definition: Let A be an algebra and let P and Q be nonsingular linear trans- 
formations on A. The algebra A* whose elements are those of A but whose product 
operation is defined by x * y = xP • yQ is called an isotope of A. Therefore, A 
and A* are said to be isotopic. 

Theorem 3: If A is a real absolute valued algebra, then A has an absolute 
valued isotope with identity. Furthermore, the absolute value function of A is pre- 
served in its isotope. 

Proof: Let 0 be an absolute value function defined on A. Since <p(ax) = | a | 0(x) 

for every real a and x in A, there exists a nonzero element e in A such that 
0(e) = 1. As before, since e / 0 and A is absolute valued, xR £ = xe and xL £ = ex 
define nonsingular linear transformations on A. Let x, z be any elements in A such 
that 


xR„ 


-1 


= z 


Then 


x = zR g = ze 


and 


0(x) = 0(zR e ) = 0(z) = 0 (xR £ _1 ) 
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Similarly, 

0(x) = 0(xL e _1 ) 

Now define an isotope A* of A by 

x *y = xR e -1 • yL e -1 

Since A and A* are the same linear spaces over R, the properties of (f> involving 
addition and scalar multiplication are preserved on A*. Also, 

<p{x*y) = 4>(xR e -1 ) • <^(yL e _1 ) = <t>{x)4>(y) 

for all x, y in A*. Therefore, A* is absolute valued and preserves the absolute 
value function of A. 

O 3 |( 

Finally, consider the product e * y in A . The product of two linear transfor- 
mations R and L is defined by 


x(R • L) = (xR) • L 


for all x in A. Thus, 

e 2 *y = e 2 R £ • yL^ 1 = e(yL e _1 ) = (yLg -1 )^ = y 

for all y in A. In a similar fashion 

x * e 2 = x 

for all x in A. Hence, e 2 is the identity of A*. Discussion of this theorem is 
given in reference 1. 

Definition: Let A be a real algebra with the basis e^, eg, • . e n . Denote the 
vector scalar product of 


x = Xjej + x 2 e 2 + 


+ xe„ 
n n 


and 


y = y^ + y 2 e 2 + . . . + y n e n 
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by <x, y > . The norm of the vector x is defined by 

<x, x> = N(x) = Xl 2 + Xg 2 + . . . + x n 2 

In the theorems which follow, use is made of the fact that the scalar product defines a 
nondegenerate and symmetric bilinear form on A — that is, 


<x, A> = 0 implies x = 0 


<x, y> = <y, x> for all x, y in A 


<x, ay + /3z> = a<x, y> + /3<y, z> 
<ax + jSy, z> = a<x, z> + /3<y, z> 


for all real a, ft and all x, y, z in A 


If A is not associative, then x n is not uniquely defined in A. Therefore, in 
order to give meaning to x n , define 


x n = x n - 1 -X 


(n - 2, 3, . . .) 


Lemma 1: Let A be a real algebra such that N(xy) = N(x)N(y) for all x, y 
in A. If x n = x n "l • x, where n = 2, 3, . . then n(x^) = N(x)^ for all positive 
integers k. 

Proof by induction: Let x be any element in A. The lemma is obviously true 
for k = 1. Now let k be an arbitrary positive integer such that 

n(x^) = N(x)k 


Then, 


N(x k+1 ) = N(x k • x) = N(x k ) N(x) = N(x) k N(x) = N(x) k+1 


Hence, 


N(x k ) = N(x) k 


for all positive integers k. 
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Theorem 4: Let A be a real algebra with identity. If N(xy) = N(x) N(y) for all 
x, y in A, then A is absolute valued and the absolute value function defined on A 
is unique. 

Proof: Define 


0(x) = ||x|| = + /N(x) 


for all x in A. Then 


0(x + y) = |x + y|| = + y<x + y, X + y> 


so that 


ll x + y II 2 = || x ll 2 + 2 l< x > y>| + IMI 2 

By use of the Cauchy -Schwarz inequality, 

! x + y | 2 = ll x l | 2 + 2 ll x ll IMI + Ikll 2 = (ll x ll + II y II ) 2 

and, hence, 

0(x + y) = || x + y || = | x | + | y | = 0(x) + 0( y) 

The remaining properties which must hold for cp to be an absolute value function 
are clearly satisfied and thus A is absolute valued. 

To avoid confusion between the identity in A and the identity in R, the identity 
in A is denoted by e. Let e, eg, e^, . . e R form a basis for A. If 0 is an abso- 
lute value function on A, then 


0(0 = 0(ee) = 0(e)0(e) 


and, hence, 


0(e) = 1 

Now suppose 0(x) is not unique on A. Then there exists an absolute value func- 
tion 0(x) on A such that 0(a) ^ 0(a) for some a / 0 in A. Therefore, either 
0(a) > 0(a) or 0(a) < 0(a). 
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Consider first 0(a) > 0(a). If 



then 


||y || = + >/NCy) = 1 

Let 

y k = yi e + y2 e 2 + • • • + y n e n 

Also 

6(y)>l since 6<y) “ f^[ > jprfl ' 1 
Furthermore, by lemma 1, 

l = N(y k ) = y x 2 + y 2 2 + . . . + y n 2 
which implies j y t | £ 1. Since 0 is an absolute value function, 

0(y) k = fl(y k ) ^ |y x | 0(e) + |y 2 | «(e 2 ) + • • • + |y n | 0 ( e n) 

and, hence, 


0(y) k g 1 + 0(e 2 )+ • . • + 0(e n ) 

But this is impossible since 0(y) >1 and k is arbitrary. 

Now consider 0(a) < 0(a). Since A is absolute valued, it is a division algebra. 
Therefore, the equation az = e can be solved whenever a ^ 0. Let y = ||a||z; then 

0 (y) = || a ||0(z) 


But since 


0(a) 0(z) = 0(e) = 1 
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then 


S(y, = M>M = 1 

0(a) ^(a) 


and 


0(y) = i 

The rest of the proof for 0(a) < 0(a) proceeds exactly as that for 0(a) > 0(a) - that is, 
a contradiction is arrived at to the assumption that 0(a) < 0(a). Hence 0(x) = 0(x) for 
all x in A, so that <£(x) is unique. 

Lemma 2: Let A be a real algebra such that N(xy) = N(x)N(y) for all x, y 
in A; then for all x, y and x', y* in A 

(1) <xy, x'y> = <x, x’>N(y) 

and 

<xy, xy*> = N(x)<y, y’> 

(2) <xy, x'y’> + <xy’, x’y> = 2<x, x’><y, y’> 

Proof: It can easily be established that for all x, y in A 

<x, y> = |[n(x + y) - N(x) - N(y)j 


Then 


<xy, x'y> = |[N(xy + x’y) - N(xy) - N(x'yj] = <x, x’>N(y) 


Similarly, 


<xy, xy’> = N(x)<y, y*> 


Therefore, 


<x(y + y'), x’(y + y’)> = <x, x'>N(y + y') 


However, 


N(y + y’) = 2<y, y’> + N(y) + N(y’) 
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Hence 


<x(y + y’), x’(y + y')> = 2<x, x’><y, y’> + <x, x’>N(y) + <x, x’>N(y') 

= 2<x, x’><y, y’> + <xy, x'y> + <xy’, x’y’> 


Now since 

<x(y + y’), x’(y + y’)> = <xy, x’y> + <xy, x’y’> + <xy', x’y> + <xy% x’y’> 


then 


<xy, x'y’> + <xy', x’y> = 2<x, x’><y, y’> 

This lemma is treated in references 4 and 5 and is used in the proof of the next 
characterization theorem. 

Definition: Let A and B be algebras over a field F. A one-to-one mapping 
i// of A onto B is called an isomorphism of A onto B if the operations of addition 
and multiplication are preserved under the mapping - that is, 

rj/(a a + /3b) = axp( a) + jSi/^b) 

i^(ab) = \f/( a) ^(b) 

for all a, b in A and a, 0 in F. Algebras A and B are said to be isomorphic 
if there exists an isomorphism of A onto B. By an automorphism of an algebra A is 
meant an isomorphism of A onto itself. If a mapping 4/ is an isomorphism (or auto- 
morphism) except that 

^(ab) = 4/(b) *Ma) 

4/ is said to be an anti-isomorphism (or anti-automorphism). 

Definition: An algebra A is termed an alternative algebra if for every x, y 
in A, x 2 y = x(xy) and xy 2 = (xy)y. 

Theorem 5: Let A be areal algebra with identity 1. If N(xy) = N(x)N(y) for 
all x, y in A, then A is an alternative algebra with involution (anti -automorphism) 
if/ : x — x such that 


xx = N(x)*l 
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and 


x + x = T(x)-1 (T(x) is real) 

Proof: Let I denote the subspace of A spanned by the identity and let I"* - 
denote its orthogonal complement. Then A is a direct sum of I and I* L and is 
written A = I © - that is, every x in A can be written as 0!*1 + a for some 

real a and a in tK For a proof of this the reader is referred to reference 6, 
page 157. 

For x = a-1 + a in A, define x = a* 1 - a and consider the mapping 4 / : x — x 
given by \js(x) = x. Clearly 


x(/(ax + /3y) = ai(/(x) + p\f/( y) 


It is now shown that 


^(xy) = >My) i^(x) 

If x' = 1 and x is taken in I"*", then by (2) of lemma 2 

<xy, y’> + <xy', y> = 2<x, l><y, y’> = 0 
for all y, y' in A. Also, by the law of multiplication defined on A, 

<(a.l)y, y’> - <y, (o'.ljy’) = 0 

for all real a. Thus, 

<xy, y’> + <(a-l)y, y’> + <xy', y> - <(o!-l)y', y> = 0 

and 


<(x + Q!-l)y, y’> = <(a-l - x)y', y> 

for all real a, x in I**" and y, y’ in A. Now, if w = onI + x, then 

<wy, y’> = <y, wy'> for all w, y, y' in A. Similarly, if y' = 1 and y is in I - ^ 

in (2) of lemma 2, <xz, x’> = <x, x'z > for all x, x', z in A. Combining these results 

gives, for all x, y, z in A, 
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If x = 1, then <y, z > = <z, y>. Therefore 


<xy, z> = <z, xy> 


Thus, 


<z, yx > - <z, xy>= 0 


which implies that yx = xy and 

^(xy) = xy = y x = y) i//(x) 

Now suppose that ^(x) = i//( y) for some x, y in A. Then, since i//(x) = x, 

x = iK<Mx)) = i^(^(y)) = y 

Since i// is an onto mapping, it defines an involution on A. 

The subspace I contains all those elements of A left fixed by the involution 
and I 1 contains all x in A such that i//(x) = -x. Since i//(xx) = xx, then xx is 
in I, so there exists a real number a such that xx = a«l. Now, 

a= Q!<1, 1> = <a-l, 1> = <xx, 1> = <(l*x), x> = N(x) 

Hence xx = N(x)-l for all x in A. Also, 


xx = N(x)*l = N(x)-1 


Finally, since x + x is left fixed by i//, there is a real number /3 such that x + x = /3*1. 
Now x + x = T(x)*l, where T(x) is a linear functional and is defined to be the trace 
of x. 

To complete the proof, it remains to show that the alternative law is satisfied for 
all elements in A. From the first part of this proof <xy, xz> = <y, x(xz)> for all 
x, y, z in A. But <xy, xz> = N(x)<y, z > from (1) of lemma 2. Hence, 


<y, x(xz)> - <y, N(x)-z> = <y, x(xz) - (xx)z> = 0 



This implies that x(xz) = (xx)z. Since there exists some real number /3 such that 
x + x = /3«1, then 


x(xz) = (fi-1 - x)(xz) = (/3-l)(xz) - x(xz) 


and 


(xx)z = [x(/3-l - xj]z = (/3*l)(xz) - x^z 


Hence 


x(xz) = x^z 

for all x, z in A. If <xz, yz> is considered, it can similarly be shown that 

xz^ = (xz)z 

for all x, z in A. Thus A is alternative and the proof is complete. This theorem is 
discussed in references 4 and 5. 

A direct consequence of this theorem is the following corollary. 

Corollary: Let A be a real algebra with identity 1 and let I denote the sub- 
space spanned by the identity of A. If N(xy) = N(x)N(y) for all x, y in A, then 
every element of A satisfies the quadratic equation x^ - T(x)*x + N(x)-1 = 0 over I. 
Furthermore, the space I is the set of all elements left fixed by the involution 
\p(x) = x, whereas I - *" is the set of all a in A such that i//(a) = a = -a. 

The proof of the converse to this theorem depends on the validity of the Moufang 
identity on an alternative algebra — that is, 

(xy)(zx) = x [(yz)xj 

for all x, y, z in A. In view of this, lemma 3 is presented and is of value in proving 
the converse to this theorem. Denote the set of all n-tuples of elements in A by A n 
and make the following definitions: 

Definition: The associator of an algebra A is a function S defined on A^ 
to A by 


for all x, y, z in A. 


S(x, y, z) = (xy)z - x(yz) 
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Definition: Let A be an arbitrary algebra and let f ^Xp Xg, . . ., x^ be a 
multilinear function defined on A n to A. The function f is said to be skew- 
symmetric provided (1) f takes on the value 0 whenever at least two of its arguments 
are equal, and (2) f changes sign whenever two of its arguments are interchanged. 

Lemma 3: Let A be an alternative algebra over a field F and define the func- 
tion K from A^ to A by 


K(w, x, y, z) = S(wx, y, z) - xS(w, y, z) - S(x, y, z)-w 

for all w, x, y, z in A; then S and K are linear skew -symmetric functions. 

Proof: The proof is contained in two parts. 

I. That S is linear in x is readily verified by expanding S^q;x^ + ^X£> y> z ) 
for any Xp Xg, y, z in A and a, (i in F. Similarly S is linear in y and z. It 
is also clear that S(x, x, y) = 0 = S(x, y, y) when A is an alternative algebra. There- 
fore, 


S(x, y + z, y + z) = S(x, y, z) + S(x, z, y) = 0 


and 


S(y + z, y + z, x) = S(z, y, x) + S(y, z, x) = 0 


Thus 


S(x, y, z) = -S(x, z, y) and S(z, y, x) = -S(y, z, x) 


Finally, 


S(x, y, z) = -S(x, z, y) = S(z, x, y) = -S(z, y, x) 

Hence S is a linear skew -symmetric function from A** to A. 

n. Now consider the function K. It is immediate that K is linear from the 
linearity of S. Also, from the definition of K, note that K(w, x, y, y) = 0. Therefore, 


K(w, x, y, z) = -K(w, x, z, y) 
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since 


K(w, x, y + z, y + z) = K(w, x, y, z) + K(w, x, z, y) = 0 
A function G on is now defined by 

G(w, x, y, z) = S(wx, y, z) - S(w, xy, z) + S(w, x, yz) - wS(x, y, z) - S(w, x, y)*z 
By expanding all the associators it is found that G(w, x, y, z) = 0 and, therefore, 

-K(z, w, x, y) = G(w, x, y, z) - K(z, w, x, y) 

Expanding G and K in terms of their associators and applying part I of this lemma 
gives 

-K(z, w, x, y) = S(wx, y, z) - S(xy, z, w) + S(yz, w, x) - S(zw, x, y) 

By use of 

S(wx, y, z) = K(w, x, y, z) + xS(w, y, z) + S(x, y, z)-w 

it is found that a cyclic permutation of the elements z, w, x, y changes the sign on the 
right-hand side of the expression for -K(z, w, x, y) - that is, 

K(y, z, w, x) = -K(z, w, x, y) 

Thus, for all w, x, y, z in A, 

K(w, x, y, z) = -K(w, x, z, y) 

and 


K(w, x, y, z) = -K(z, w, x, y) 


Since these two permutations of the elements w, x, y, z generate the entire symmetric 
group of permutations, the skew-symmetry of K has been proved. This lemma is 
treated in references 3 and 7. 

Lemma 3 is now used to prove the converse to theorem 5. 
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Converse: Let A be a real algebra with identity 1. If A is an alternative 
algebra with involution if/ : x — x, where xx = N(x)-l and x + x = T(x)-l (T(x) is a 
real number), then N(xy) = N(x)N(y) for all x, y in A. 

Proof: First prove the validity of the Moufang identity on A. It can be easily 

verified that 


(xy)(zx) = x[y(zx)] + S(x, y, zx) 

From lemma 3, 


S(x, y, zx) = -S(zx, y, x) = -S(x, y, x)-z - xS(z, y, x) - K(z, x, y, x) 
and, therefore, 


S(x, y, zx) = xS(y, z, x) = x j]yz)x] - x [y(zx)] 


Hence 


(xy)(zx) = x [(yz)x] 

for all x, y, z in A. Since x + x = T(x)*l for all x in A, then 

x 2 y + (xx)y = [x(T(x)-l)]y 

and 


x(xy) + x(xy) = x [(T(x)-l)y] 

for all x, y in A. By the law of multiplication defined on A, 

[x(T(x)-l)]y = T(x)-(xy) = x [(T(x)-l)y] 
Therefore, since A is an alternative algebra, 

(xx)y = x(xy) 


Similarly, 


x(yy) = (xy)y 
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For every x, y in A, 


N(xy)-1 = (xy)(yx) = (xy) [y(T(x)-l - xj] 
= T(x)-(xy)y - (xy)(yx) 


By the Moufang identity 


N(xy)*l = T(x) [x(yyj] - x [(yy)x] 
= T(x)N(y)>x - N(y)*x 2 


and, hence, 


N(xy)»l = N(y)-x[j(x)-l - x] = N(y).(xx) = N(y)N(x)-l 
The converse to theorem 5 is discussed in references 4 and 5. 

COMMUTATIVE AND ASSOCIATIVE ALGEBRAS 

In the previous section the assumption of commutativity and associativity of multi- 
plication concerning real linear algebras was ignored. In this section further character- 
izations and uniqueness of real linear algebras having these properties are presented. 

Definition: A skew field is a ring in which the nonzero elements form a group 
under multiplication. A commutative skew field is called a field. 

Theorem 6: Let A be a real division algebra. If multiplication on A is asso- 
ciative, then A is a skew field. (If, in addition, A is commutative with respect to 
multiplication, then A is a field.) 

Proof: If A is associative with respect to multiplication, then A is a ring. 

Let a, b be nonzero elements of A. Since A is a division algebra, there is an ele- 
ment x in A such that ax = b. Similarly, there is an element y in A such that 
by = x. Hence a(by) = ax = b. Since b / 0 and a(by) = (ab)y, it follows that ab ^ 0* 
Thus A has no nonzero divisors of zero. It can now be shown that A has an identity. 

Let a be any nonzero element in A. There exists an element e in A such 
that ae = a. Then e / 0. Now ae 2 = ae, which implies that e 2 = e since a is not a 
divisor of zero. Let x be any element in A. Then 

(x - xe)e =0 
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and 


e(x - ex) = 0 

Hence xe = ex = x, so that e is the identity element of A. As before, this element is 
denoted by 1. 

It is now shown that every nonzero element of A has a multiplicative inverse. 
Let a be any nonzero element of A. There exists an x in A such that ax = 1. 
Then x / 0. Furthermore, 


(xa - l)x = x(ax) - x = 0 

Therefore, xa - 1 = 0 or xa = 1 and, hence, x is the inverse of a. It has been 
shown that the nonzero elements of A form a multiplicative group. Hence A is a 
skew field. Furthermore, if A is commutative with respect to multiplication, then A 
is a commutative skew field or simply a field. This theorem is treated in reference 8, 
pages 18-21. 

It shall now be proved that except for isomorphisms, the real and complex numbers 
form the only commutative division algebras over the real numbers. As before, the 
space spanned by the identity of A is denoted by I. Since A is real, I is clearly 
isomorphic to the field of real numbers. 

Definition: An element a in A is said to be in the center of A if ax = xa 
for all x in A. 

Definition: Let A be a division algebra with identity 1 over a field F. A is 
said to be algebraic over a field K if (1) K is contained in the center of A, and 
(2) every element a in A satisfies a nontrivial polynomial with coefficients in K. 

In this and following theorems the center of A is denoted by C(A). 

Lemma 4: If A is a real associative division algebra, then A is algebraic 
over I. Furthermore, each element of A satisfies a nontrivial linear or quadratic 
equation over I. 

Proof: Since A is an associative division algebra, A has an identity; since A 
is real, I is isomorphic to the field of real numbers. Also, if a is any real number, 
then by the rule of multiplication defined on A, (a-l)a = a(a»l) for all a in A. 

Thus I is contained in C(A). 

Since A is associative, the product of k factors a can be expressed by a^. 

If A is of order n, the set of n + 1 elements (that is, 1, a, a2, . . ., a 11 ) are linearly 
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dependent with respect to R. Hence there exist real numbers otp . . ., a n , not all 
zero, such that 


2 n 

q!q * 1 + a + Qfga + . . . + « n a = 0 

Therefore a is a root of an equation of degree ^ n with coefficients in I. If 

p(x) = ( 2 q *1 + a^x + . . . + ff n x n 

and since I is isomorphic to R, by the fundamental theorem of algebra 

p(x) = f 1 (x)-f 2 (x) . . . f k (x) 

kin, and fj(x) is of degree 1 or 2. Since p(a) = 0, some f^a) = 0 and thus a is 
a root of a linear or quadratic equation over I. This lemma is discussed in reference 9, 
page 10. 

Lemma 5: Let A be an associative division algebra over the field C of com- 
plex numbers. If A is algebraic over C* = CM, then A = C*. 

Proof: Since A is algebraic over C* and if a is any element of A, there 
exist complex numbers Cq, Cp Cg, . . ., c n , not all zero, such that 

Cq* 1 + c^a + . . . + c fi a n = 0 

Again, by making use of the fundamental theorem of algebra, the polynomial 

p(x) = Cq- 1 + CjX + . . . + c n x n 
can be factored into a product of linear factors - that is, 

p(x) = ^x - Xpl^x - X 2 *l^x - Xg*l^ . . . ^x - X n -1^ 
where Xp X 2 , . . ., X n are in C. Now since p(a) = 0, some 

a - Xj'l = 0 

Hence a is in C* and it has been shown that A 9 C*. Since A is algebraic 
over C*, C* 9 A. Therefore, A = C*. This lemma is treated in reference 6, 
pages 326-327. 
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Theorem 7: Let A be areal associative division algebra. If A is commuta- 
tive, then A is isomorphic to either the field of real numbers or the field of complex 
numbers. 

Proof: By lemma 4, A is algebraic over I and hence I, which is isomorphic 
to R, is contained in C(A). Suppose that I / A; then there exists an a in A which 
is not in I. Therefore, a satisfies some quadratic equation with real coefficients. 
Otherwise, a would be in I. Let 

p(x) = x 2 + 2 ax + Q!q» 1 

such that p(a) = 0 and where a and are real. Then 

(a + a- l) 2 = cn^.l - 0 !q *1 

For any x in A and y' in R, if x^ = y’*l and y' > 0, there is a real number y 
such that 

x 2 = y2.j 


Then 


x 2 - y 2 .l = (x + yl)(x - yl) = 0 

which implies that x = ±yl. Hence a? - Q!q < 0 for if this were positive, there would 
exist a y in R such that 


a + ci • 1 = ±y-l 

However, this implies that a is in I. Hence, there is a real number 0 such that 
a 2 - ct q = -/3 2 . Therefore 


(a + o! • l) 2 = -/3 2 * 1 


Thus, if a is in A but not in I, real numbers a, can be found such that 



Set 


o 

so that i = -1 and, hence, A contains I + Li which is isomorphic to the field of 
complex numbers. This field is denoted by C*. It remains only to show that A = C*. 

Since A is algebraic over I, then A is algebraic over C*. If a in A satis- 
fies a polynomial with coefficients in I, then a clearly satisfies a polynomial with 
coefficients in C*. Also, C* 5 C(A) since A is commutative. Therefore, by 
lemma 5, C* = A and the proof is complete. This theorem is treated in reference 6, 
page 327. 

The property of commutativity on A is now dropped and the characterization of 
real division algebras which are associative is continued. 

Theorem 8: Let A be an associative division algebra. For some a contained 
in A, let R a and L a be the linear transformations on A such that xR a = xa and 
xL a = ax for all x in A. Then A is isomorphic to 

A R = { R x I x A } 

and anti-isomorphic to 

A L = { L x | x in A } 

Proof: Define i//(x) = 1^ for all x in A and show that ^ defines an isomor- 
phism of A onto Ar . First consider 

Max + 0y) = R ax+ ^ 

for x, y in A and a, /3 in R. Note that for any a in A 

aR ax + , 3y = ^ ax + & ) = a ( aR x) + ^( aR y) 


Hence 


R . — a R + /3R 
ax+i 3y x H y 
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so that 


xp{ax + 0y) = a\p{x) + /3^(y) 

Now consider xp{xy) = R v „. For a in A, 

aR^y = a(xy) = (ax)y = (aR^Ry = a^-Ry) 


Hence 


so that 


^xy ” 


i//(xy) = \p(x) xp(y) 


Finally, suppose xp(x) = xp(y) and let a be any nonzero element in A. Then 
aR^ = aRy so that a(x - y) = 0 and, since A is an associative division algebra, 
x = y. Therefore, since xp is an onto mapping, A is isomorphic to a r- 

Now consider the mapping xp' (x) = L x . Note that for a in A, 
aL^ = (xy)a = x(ya) = (aL y ^L x = a^IyL^ 


Hence 


\p'(xy) = xp' (y) xp'(x) 

so that i p' defines an anti-isomorphism from A onto A^. 

Thus, if A is an associative division algebra with basis ej, eg, . . e n and if 

x = x iei + x 2 e 2 + . . . + x n e n 


is any element in A, then 


x l R e 1 + x 2 R e 2 + • • • + x n R e n 
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under the mapping xfy and 


x — X-.L. + x 0 L. + . . . + x„L 

1 ej iJ eg n e n 


under the mapping i//' . Therefore, R , R , . . R 

e l e 2 


form a basis for A R and 
e n n 


L e _, . . L 


pages 240-241. 


form a basis for A-,. This theorem is treated in reference 10. 
e n 


Theorem 9: Let A be a real associative division algebra; then the algebras A^ 
and A-^ of linear transformations on A are isomorphic to algebras of real n x n 
matrices. 

Proof: Let 


x = x iei + x 2 e 2 + . . . + x n e n 

where x^ is real and e^, e 2 , . . ., e n form a basis for A. Now xR a = xa is in A 
so that 


and 


Now 


xR a = xa = y = + y 2 e 2 + . . . + y n e n 



e.R = ) a. .e* 

j 

y - - £ *i £ “if j = ^ “if iVj 


and, hence, 


h ■ I a ifi 


with the matrix denoted by m^R & j. Thus, the linear transformation R & which 
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I 


sends the vector x having components ^Xj, Xg, . . x R ^ into the vector y having 
components ^y^, y^, . . ., y n ^ can be represented by the real n x n matrix where 

e i B a - 1 “ij e r 
j 

The mapping 0^R X ^ = m^R x ^ for all x in A is now shown to define an isomor- 
phism of A R onto M(A r ) = |m^R x ^ j R x in A R j.. 

First, note that for real a, /3 and R x , Ry, in A R 

e(«R x + /3R y ) = m ^<sR x + /3R y ) = + j3mjR y j 

since 


e i ( aR x + ^ R y) = a ( e i R x) + ^( e i R y) 

Suppose 

C i R x - 1 “Vi 

j 

and 

e j R y = X 

k 

then 

e i( R x R y) = Yj “ij ( 6 i R y) = E “ijJ % e k 

j J k 

Therefore, 

e i( R x R y) = X 

j, k 

which implies that 

m N = ("ii)(v) =m N’ B (M 
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that is, 


“(VS-) = ( T ik) =1 Vi* 

j 

Hence 

*(W) = m(R x )m(R y ) 

Now suppose fl^R^ = y ^; then m^R x - R y ^ = (0). If 

m ( R x ' R y) = ( a ij) 

each = 0 so that 

- R y) = e i< x - ^ * 0 

which implies that x = y so that R x = R y . Hence the mapping 6 is an isomorphism 
of A r onto M ( Ar )‘ 

Similarly, if 

M ( A L) = { m ( L x)| L x in A l} 
it can be shown that the mapping 6' defined by 

9 '( L x) = m ( L k) 

for all x in A is an isomorphism of A^ onto M^A-^. This subject is discussed 
in reference 10, pages 202-213. 

Corollary: Let A be a real associative division algebra; then A is isomorphic 
to the algebra M^A R ^ and anti -isomorphic to the algebra M^A-^. 

Definition: The isomorphism A = M(A R ) is known as the first regular repre- 
sentation of A and the anti -isomorphism A = M^A L j is called the second regular 
representation of A. 

Hence, given an associative division algebra A with basis e R eg, . . e n , 
there is for any 


x = Xje^ + Xgeg + 


+ x n e 
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in A the following correspondence: 


x-, R + x 0 R„ + . . . + x R„ 

1 e l 2 e 2 n e n 


and 


x 


X 1 



+ x 2 



+ 


' + x n m 



Similarly 


x _L x — m (L x ) 

An example of a regular representation of the algebra of complex numbers is as 
follows: Let C denote the algebra of complex numbers. Then 1, i is a basis for C, 
so that for any a + 0i in C 


0! + /3i aRj + /3Rj a m^Rj^ + 0m^R i ) 
where m^R^ is given by 

lRj = 1 + Oi 
iRj = 0 + i 

and m^R^ is given by 

lR i = 0 + i 
iRj = -1 + Oi 

Therefore, 



Since C is a commutative algebra, the first and second regular representations of C 
are identical. 
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The algebra of real quaternions is constructed by imitating the construction of the 
complex numbers. Again, let C denote the algebra of complex numbers. 

Consider the set Q of all ordered pairs (a, b) where a, b are complex numbers. 
For all complex numbers a, b, c, d the operation of addition on Q is defined by 
(a, b) + (c, d) = (a + c, b + d). Scalar multiplication is given by c(a, b) = (ca, cb). Under 
the given operations, Q is a vector space over the field of complex numbers. Each 
(a, b) in Q can be expressed as 

(a, b) = a(l, 0) + b(0, 1) 

Multiplication on Q is defined as follows: 

(a, b)(c, d) = (ac - db, da + be) 

where the bar indicates the complex conjugate. The multiplicative identity is clearly 
1 = (1, 0). If j = (0, 1), then j2 = -1. Now, every (a, b) in Q is uniquely expressible 
in the form 


(a, b) = a- 1 + bj 

and the rule of multiplication on Q can be written as 

(a-1 + bj)(c>l + dj) = (ac - db) *1 + (da + bc)j 
Let a = <Iq + and b = where ckq, a j, o^, are real; then, 

(a, b) = a.l + bj = 0 !q -1 + + a^j + 

Let (A, o) = i and (o, /^l) = k. Thus, each element (a, b) in Q is uniquely repre- 
sented in the form 


(a, b) = (Xq-I + a^i + Qfgl + a 3 k 

By this rule of multiplication is computed the following table which also defines multi- 
plication on Q: 

i 2 = j 2 = k 2 = 
ij = -ji = k 
jk = -ki = i 
ki = -ik = j 
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Under this rule of multiplication, 

Q = | a 0 *l + eeji + o-gj + ff 3 k | a^, a 2 , Qfg real | 

is a real associative algebra with identity. The elements of Q are known as quater- 
nions and Q is called the algebra of real quaternions. 

Theorem 10: The algebra of real quaternions is an absolute valued algebra. 

Proof: Apply the converse to theorem 5. First, for every q in Q define the 
quaternion conjugate of q by 


q = Q!q • 1 - (a^i + a 2 ) + 


By simple multiplication it can easily be shown that the mapping i//( q) = q defines an 
involution on Q. Similarly, it can be shown that qq=N(q)*l for all q in Q. 
Finally, define T(q)-1 = q + q for all q; then T(q) is clearly real. Since Q is 
associative, N(pq) = N(p)N(q) and the proof is complete. 


This section is concluded with the following proof of the uniqueness of the algebra 
of real quaternions. 


Theorem 11: Let A be a real associative division algebra. If A is not commu- 
tative, then A is isomorphic to the algebra of quaternions. 


Proof: It is first shown that I = C(A). By lemma 4, I 9 C(A). Suppose there 
exists an element a in C(A) such that a is not in I. Then, as previously shown, 


there would exist real numbers a, fi such that 



-1. Thus, C(A) would 


contain a field C* isomorphic to the field of complex numbers. Hence, A would be 
algebraic over C* and therefore by lemma 5, A = C*. This contradicts the assumption 
that A is not commutative. Therefore, I = C(A). 


Let 

such that 
that 


a be any element of A such that a is not in I and take 

O 

i = -1. Then i is not in I so there exists an element b 


i = / a + QLll 

V 0 

in A such 


c = bi - ib ^ 0 


Note that 


9 2 

ic + ci = i(bi - ib) + (bi - ib)i = ibi - i^b + bi - ibi = 0 
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so that ic = -ci. Furthermore, 


ic 2 = (ic)c = -(ci)c = c(ci) = c 2 i 

o 

so that c commutes with i. 

Now c satisfies some quadratic equation over I. Let 

c 2 + yc + 5-1 = 0 (y, 6 real) 


Since 


yc = 


-c 2 


6-1 


then yc commutes with i. Hence 


yci = iyc = yic = -yci 


and 2yci = 0. Since 2ci ^ 0 and A is a division algebra, y = 0. Therefore 
c 2 = -6*1. Also c cannot be in I since ic = -ci. Hence 6>0 so that 5= £ 2 
($ real). 


Let j = then j 2 = -1. Also, 


ji + ij = £!±ic = o 


and, therefore, ij = -ji. Let k= ij. 

Hence A contains the algebra C* + C*-j which is isomorphic to the algebra of 
real quaternions. This algebra is denoted by Q*. 

Finally it is shown that Q* = A. Suppose Q* ^ A. Then for some x in A but 

not in Q , an element l can be determined in A which is not in Q and such that 
2 

l - -1. Now i ±1 are roots of quadratic equations over I. For real a j, a^, &2’ 

let 


(i + l) 2 + t»j(i + l) + atg"! = 0 


and 


(i - l) 2 + /3 1 (i - l ) + j3 2 *1 = 0 
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I 


Hence 


(i + l) 2 


-2-1 + iZ + li = -a!j(i + l) - <*2*1 


and 

(i - l) 2 = -2-1 - il - li = -p x (l - l) - jSg-l 

Adding gives 


(«1 + ^l) 1 + («i ' ^i)* + ( a 2 + “ 4 )’ X = 0 

Since 1, i, l are linearly independent, 


“1 = 


= 0 


and, therefore, 


tt + li = ck - 1 


Similarly, 


and 


ji + 1} = /3* 1 


kf + ik = yl 


Thus 


(a real) 


(A y real) 


Zk = (U)j = (ce-1 - U)j = aj - i(j3*l - ji) = aj - /3i + kf 


Then 


2k l = y-1 + /3i - aj 


Multiplying by k gives 


-2Z = yk + /3j + ai 
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This result implies that l is in Q* and contradicts the assumption that Q ^ A. 
Hence A = Q* and the proof is completed. This theorem is treated in reference 6, 
pages 327-329, and in reference 4, pages 100-112. 

From some of the previous results the following corollary can be stated. 

Corollary: Let A be a real associative absolute valued algebra. Then A is 
isomorphic to the real numbers, to the complex numbers, or to the real quaternions. 

THE ALGEBRA OF REAL QUATERNIONS 

Let Q denote the algebra of real quaternions. 

Theorem 12: Let p be a fixed nonzero quaternion. Then 0(q) = pqp - ^ is an 

automorphism on Q. Furthermore, every automorphism on Q is of this type. 

Proof: Since Q is an associative division algebra, every nonzero element of Q 
has a unique inverse. Consider 


0(q) = pqp" 

for all q in Q and some fixed nonzero element p. If q^, q 2 are arbitrary elements 
in Q and a, 13 are real, then 

0^q 1 + /3q 2 ) = p(aiq 1 + /3q 2 )p _1 = “Pq^" 1 + 0Pq 2 P _1 

and, hence, 


0^q 1 + 0q 2 ) = a-^qj) + 00(q 2 ) 


Also 


0 ( q i q 2) = pqi(p“ 1 p)q 2 p ‘ 1 = (pqiP -1 ) ( pq 2 p _1 ) 


so that 


e( q i q 2 ) = 0 (q!)- 0 (q 2 ) 

Finally, if 0 ^q ^ = 0^q 2 ^, then q^ = q 2 since Q does not have any nonzero divisors 
of zero. Since 8 is an onto mapping, 8 is an inner automorphism on Q. 
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Now suppose O' is an automorphism of Q and let 


0'(q) = q’ 

Suppose that under the mapping O' 


1 — 1 



then 1, e-p eg, e g obey the same rule of multiplication as defined for 1, i, j, k. There 
now exist elements p^, pg, p g in Q such that 

P 1 = e 3^ " e 2 k + e i + i 
Pg = ejk - e 3 i + eg + j 

P 3 = egi - ejj + e 3 + k 

It is now shown that for every q in Q 


q'Pi = Pjq 

q’P 2 = p 2 q 

and 

q’P 3 = P 3 q 

where q' is the image of q under the mapping O' . By using the previously described 
rule of multiplication on Q, 


and 


ejPj = -e 2 j - e g k - 1 + e^i 


Pji = -e g k - egj + eji - 1 
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so that 


Similarly, 


and 


e l p l = Pi 1 
e 2 p l = p l3 

e 3 p l = Pl k 


Hence for every q in Q, 


Q’Pj = P x q 


and, similarly, 


q ,p 2 = P 2 q 


and 


q p 3 = p 3* 

If one of the elements Pp p 3 , p 3 is not zero, the proof of the theorem is complete. 
Now, suppose that 


P X = P 2 = 0 

then 


e l + i = e 2 k - e 3 j 


Also 


p 2 = - e g i + e^j - ik = e^ej - i) + ^ - i)k = 0 
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since 



-k 


then 


From 


( e l ' *) = e 3( e l ' i ) k = e 2 k+ e 3 j 


e^ + i = egk - e^j 


is obtained 


i = -e 3 j 

which implies that e 3 = k. Similarly, if 

p 2 = p 3 = ° 


then 


i = 


e l 


and if 


P 3 = P X = 0 

then 

j = e 2 

Thus, if p^ = P 2 = P 3 = 0, then O' must be the identity mapping and p = 1. This com- 
pletes the proof of the theorem. This theorem is discussed in reference 11, 
pages 257-259. 

Theorem 13: The collection of all automorphisms on Q form a multiplicative 
group of linear orthogonal transformations on Q. 
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Proof: Let G be the collection of all automorphisms on Q. By theorem 12, 
the elements of G are linear transformations of the form T p where p is a fixed 
nonzero quaternion and 


qT p = pqp“ 

for all q in Q. First, note that G is closed under multiplication - that is, suppose 
p-p pg are fixed nonzero elements of Q; then for any q in Q 

“(VS) " ( qT Pl) T P 2 = ( p 2 p l) q ( p l' lp 2' 1 ) 

But 

pfV 1 = piP 2 /n(PiP 2 ) = p 2 Pi/ n (PiP 2 ) = (p 2 Pi ) _1 

and, hence, 





so that G is closed. Similarly, 



for fixed nonzero elements p^, Pg, Pg and q in Q. Thus G is associative. 

Now, Tj is clearly in G. If Tp is any element of G and q is an arbitrary 
element of Q, then 


a( T i' T p) - i T p 

and, therefore, is the identity in G. Finally, since each element of G is a non- 
linear transformation on Q, Tp - * exists for each Tp in G. Hence G is a multi- 
plicative group of linear transformations on Q. 

Note that for all q in Q and each T in G, 

r 

<(qT p , qT p ^= N(pqp _1 ) = N(q) = <q, q> 

Thus G is a group of linear orthogonal transformations on Q. 
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As for theorem 5, note that 


Q = I © l - 1 


where I is isomorphic to the field of real numbers. Furthermore, from the construc- 
tion of Q, I"*" is isomorphic to the real Euclidean vector space of dimension three. 
Denote I- 1 - by Eg. Then every element in Q is of the form 

q = r + v 

where r is in I and v in Eg. 

Theorem 14: Let G be the group of all automorphisms on Q. Then (1) the ele- 
ments of I are invariant under the transformations of G and (2) G defines the group 
of all rotations on Eg. 

Proof: That the elements of G leave the elements of I fixed is clear since 
each r in I is of the form 1, where a is real. Thus, for each r in I and 
T p in G, 

rT p = p(o!-l)p _1 = r 
and, hence, for any q = r + v in Q and Tp in G, 

qT p = r + pvp' 1 

Consider the effect of an element in G on an element of Eg. Let 

p = ckq- 1 + a^i + a-gj + a 3 k 
be any fixed nonzero element of Q, and let 


v = xi + yj + zk 

be an arbitrary element of Eg. Then 


vT 

P 


= pvp 



Expanding this expression shows that vTp is in Eg. Since 


Tp is an orthogonal 
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transformation, Tp defines either a rotation on Eg or a rotation followed by a 
reflection on Eg. Let 


v' = x'i + y'j + z'k 
denote the vector in Eg such that 

vT = v* 

V ip 

From the expansion of vTp results the following matrix representation of this 
transformation: 



0 + a l - a 2 - “ 3 ) 2 { a l a 2 - a 0 ff 3) 2 (“l“3 + O! 0 Q! 2) 


X 


2 { a 2 a l + « 3 <* 0 ) + a 2 - a 2 - a 2 ) 2(o- 2 o- 3 - fl^) 


y 

■ "(Tp) 

2 ( a 3 a l~ a 2 a o) 2 { a 2 a 3 + a 0 a l) ( a 0 + a l - a l ' a l)j 


z 



Let m^Tp^ denote the matrix of the transformation 

A(p) = N(p).m(T p ) 

Since the determinant of m^Tpj is given by 


det [rr 


L 


m ( T p)l * 41 

' _ 


Tp and let 


then 


det A(p) = ±N(p) 3 

Inasmuch as det A(p) is a polynomial in o!q, a^, a^, «g, it is a continuous function 
from 
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to R and is either always positive or always negative. That is, suppose there exist 
fixed nonzero quaternions p^ and Pg such that 


det 


> 0 


and 


Since 


det A 


( p 2) 


< 0 


{r 4 - (0, 0, 0, 0)} 


is connected, it is shown by the intermediate value theorem (ref. 12, p. 322) that there 
exists a Pg in Q such that 

det A(p 3 ) = ±N(P 3 ) 3 = 0 


But this expression implies that p 3 = 0 which is impossible since Pg = 0 is not in the 
domain of det A(p). Therefore, 

det [ m ( T p)] = 1 

Tp in G or 

det [“( T p)] = - 1 

T in G. Now consider T t in G defined by 

r r 

P' = O! • 1 

Then clearly det A(p') = a® = +N(p')^ and is positive for all transformations in G. 
Therefore, 


for all 


for all 


det 



= +1 


for all Tp in G. Hence G is a group of rotations on Eg. 

It is now shown that G is the group of all rotations on Eg. Let R denote any 
rotation on Eg. From analytic geometry R can be defined by the direction cosines of 
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the axis of rotation together with the angle of rotation about that axis. Let £, 77, and £ 
denote the direction cosines of the axis of rotation with the X, Y, and Z axes, respec- 
tively. Also, let a) denote the angle of rotation. Whittaker (ref. 13 , p. 7 ) has shown 
that R has the following matrix representation: 


1 -2(l - | 2 )sin 2 £ 

2 sin £ ^77 sin ~ - ? cos £^j 


2 sin — (££ sin — + 7] cos — 
2 2 2 


2 Sin sin f + ? cos f 

1 - 2(1 - 7] 2 )sin 2 ^ 

2 Sin sin ^ - £ cos f 


This matrix is precisely the matrix obtained when 


<*0 - cos £ 


i. . CO 

a 1 = -k sin — 


a 




2 = -V sin 5- 


and 


2 Sin f sin f “ 77 cos f ) 

2 sin ^(77? sin — + £ cos — ^ 

2 V 2 2 ) 

1 - (l - ? 2 )sin 2 | 


« . CO 

a 3 = sin — 

are substituted in the expansion of vTp = pvp"* previously given. Now cKq, a a g, a 3 
cannot all be zero since 

| 2 + ? 7 2 + = 1 

Thus, a fixed nonzero quaternion is found - namely, 


CO t J..CO. . CO . . CO , 

p = cos — . 1 - £ sin -1 - 77 sin — 3 - ? sin — k 


such that 


vTp = vR 

Hence R is in G and the proof of the theorem is complete. 
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Corollary: The most general rotation of a vector v in Eg can be defined by 


vTp = pvp 


where 


p = cos ^ • 1 - sin |^(£i + m + ?k) 

77, and £ are the direction cosines of the axis of rotation with the X, Y, and 
Z axes, respectively, and a> is the angle of rotation about the axis. 

CHARACTERIZATIONS OF QUATERNIONS 

Some of the properties and characterizations of Q which follow from the theorems 
of this paper are now given. 

(1) Multiplication on Q in Gibbs notation (see ref. 14, pp. 403-429): Let £vj, v^j 
denote the vector cross product of elements in Eg. Then by the rule of multiplication 
defined on Q, it can readily be established that 

(a) v l v 2 = ” < v l’ v 2>* 1 + fl’ v 2] 

Hence, for all = r j + v^ and q 2 = r 2 + v 2’ 

(b) qi q 2 = (r t r 2 - <v lf v 2 >-l) + (r ^ 2 + r 2 v l + [ v l> v 2]) 

The relationships (a) and (b) yield the following interesting identities: 

( c ) v 1 v 2 + v 2 v 1 = “ 2 < v 1 ’ v 2>* 1 

v 1 v 2- v 2 v 1 = 2 [ v U v 2] 
v l ( l2- < l2 v l = 2 [ v U v 2] 

and 

qi q 2 " ^1 = 2 [jp v 2] 
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(2) By theorem 10, i^(q) = q= r- v is an involution on Q for all q = r + v 

in Q, q being defined as the conjugate of q. Thus, 

T(q)-1 = q + q = 2r 


N(q)*l = qq = qq 


exp + /3p = <xp + /3q 


and 


pq = qp 


for all p, q in Q and real ex, [3. 

(3) First regular representation of Q: Let 

q = oTq' 1 + a^i + exg j + 

For some p in Q the linear transformation Rp is defined by 


qRp = qp 

for all q in Q. Then from theorems 8 and 9 

q — o-qRj + a ^ + o-gRj + ofgR,^ 


and 


q — eXpin^R^ + <*]_ m (Ri) + a 2 m ( R j) + a 3 m (' R k) 
As an example, mf R^ is given by 


lRj = i = 0*1 + i + 0*j + 0*k 
iR^ = i^ = -1 + 0*i + 0*j + 0-k 
jRj = ji = 0-1 + 0'i + 0*j - k 


kRj = ki = 0-1 + 0-1 + j + 0'k 
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Thus 



m 


R< 



0 

0 

0 

-1 


1 

0 

0 

0 



and, hence, 
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(4) Second regular representation of Q: In a manner similar to that for the first 
regular representation, 

q ~ + or^ + ffgLj + a 3 L k 

and 

q — “o m (I J 1 ) + a l m ( L i) + a 2 m ( L j) + 0! 3 rn ( L k) 

with Lp being defined by qLp = pq for all q in Q. Thus, 
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and, hence, 


“l a 2 

a 0 “3 

“3 “0 

a 2 -a ± 

(5) Rotations: The Euler angles \f/, 9, <p provide the most widely used technique 
for describing a rotation in Eg. Let 0 xyz denote a right-hand system of rectangular 
axes fixed in space and let v denote any vector in this system. In theorem 14 it was 
shown that any rotation of v can be defined by 

vTp = pvp -1 

where p is some fixed nonzero quaternion 

P = Q! q* 1 + Oiji + Qigj + 

This rotation can also be described by three successive Euler angle rotations. The 
relationship between the quaternion components a q, at^, a 2 , oig and the Euler angles 
if/, 9, cp is now derived. 

Let R , denote a rotation of i// about the Z-axis, which rotates the 0 sys- 
z 

tern into the CL „ _ system, and let 
x l y l z 

vI V,z = V 1 

Then, by the corollary of theorem 14, there is a fixed nonzero quaternion p^^ such that 

vI V,z = vT Pl = Pl^l" 1 = V 1 

where 

p x = cos ^-1 - sin 
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I 



I 


II I I II III II I 


Similarly, let 


such that 


R s,y 1 : °*iyi z ~ °x'yj z j 


v i R «,y 1 = v 2 


Then there exists a in Q such that 


v,R„_ = VlT„ - P2V2' ='2 


1 •,y 1 " i‘j| 


where 


Finally, let 


such that 


P 2 = cos f' 1 - sin | j 


%,x’ : ^’yjZj "*■ ^x'y’z' 


v 2 R 0,x' = v ’ 


Then, there is a pj in Q such 


that 


where 


v 2 R 0,x' = v 2 T p 3 = P3 v 2 p 3 _1 = v ’ 


Po = cos ^-1 - sin i 
3 2 2 


The total rotation of a vector v in 0_„ into v' in 0 . . , is given by 

xyz x y z 

vT = pvp"* = v' 

A - ' 

where p is some fixed nonzero quaternion 

p = (Sq- 1 + a^i + a 2 j + ffgk 
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Since 


v ' = P3P 2 (Pi v Pf 1 )p2' 1 P3“ 1 


then 


P = P 3 P 2 Pi 


and, hence 


p = ^cos ^>1 - sin £ i^cos ^-1 - sin ^ j^cos ^*1 - sin 

Expanding the right-hand side of this expression gives the following relationships between 
the quaternion components and Euler angles: 

a n = cos cos ^ cos ^ + sin ^ sin ^ sin ^ 

0 2 2 2 2 2 2 

a* = cos ^ sin ^ sin ^ - sin ^ cos cos ^ 

1 2 2 2 2 2 2 

a 0 = -cos sin !• cos - sin ^ cos ^ sin ^ 

2 2 2 2 2 2 2 

a, = -cos ^ cos ^ sin ^ + sin ^ sin - cos ^ 

3 2 2 2 2 2 2 

The rotation of v into v' has the following matrix representation in terms of the 
Euler angles (ref. 15): 


x' 


y’ 


cos cos 9 


cos 6 sin i p 


-sin 6 


cos \f/ sin 9 sin cp - cos cp sin \p sin < p sin 9 sin \p + cos i// cos cp sin c p cos 9 

cos cp sin 9 cos ip + sin cp sin ^ cos (p sin 9 sin i// - sin cp cos if/ cos cp cos 9 


Since the Euler transformation is identical to the quaternion transformation, the following 
additional relationships are easily determined: 
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4/ = tan" 


2 ( a l Q! 2 ~ ^0^3) 

( 2 2 2 2 \ 


0 = sin 


-l P 2 ( Q! l Q! 3 + Q! 0 Q! 2) 
N(p) 


cp = tan 


-ll 2 ( Q; 2 Q! 3 ~ a Q a l) 

( 2 2 2 2 \ 

[a Q + a z -a 1 -a 2 ) 


This subject has been discussed in detail in references 15 and 16. 

(6) Rate equations (see also refs. 15 and 16): Suppose that the system ° x ty» z t is 
rotating with an angular velocity co. Let 

= pi + qj + rk 

where p, q, and r are the angular velocities about the X', Y', and Z' axes, 
respectively. The Euler angle rates are expressed as follows (ref. 15): 


£ 


0 sin cp / cos 9 cos <p / cos 9 


P 

e 


0 cos <p -sin cp 


q 

<p 


1 sin cp sin 0/cos 9 cos cp sin 9/ cos 9 


r 


The obvious disadvantage of this system of equations is the singularity existing at 

0 = (2n - 1) | (n = 1, 2, . . .) 

It is now shown that no such problem exists in the corresponding rate equations for 
the quaternion components. Suppose that the quaternion q is a function of the scalar 
quantity t - that is, 


q = q(t) = w(t)-l + x(t)i + y(t)j + z(t)k 
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Then analogous to the definition of a derivative in the Euclidean vector space of dimen- 
sion three, define 


dq(t) _ lim q(t + At) - q(t) 
dt At ~° At 


or 


jr q(t) = jr w(t)*l + £ x(t)i + y(t)j + z(t)k 
dt dt dt dt dt 

From this expression it follows that 

l( q l q 2) = (l q l) q 2 + q l(| q 2) 

Now let Tp be the quaternion transformation rotating the vector v in 0 xyz 
into v T in 0 X , , z , — that is, 

v' = vT = pvp" 1 

r 

where p and v are functions of t. Furthermore, let O x »yt z » be rotating with an 
angular velocity o> as defined previously. Finally, define the quaternion 

X = Xq- 1 + Xji + Xgj + Xgk 

by 


where 

Then 


= J2_ = E_ 


P| + /N(p) 


Xi = W 


(i = 0, 1, 2, 3) 


v' 



= XvX 


and, also, 


v = v'T = p”'*' v’p 

r 


Xv'X 
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From this relationship, the following matrix representation for T is obtained: 

r 


2 . a 2 A 2 A 2 
0 + A 1 A 2 a 3 

2 ( A 1 A 2 " A 0 A 3) 

2 ( A 1 A 3 + A 0 A 2) 

2 ( A 2 A 1 + A 3 A o) 

2 2 2 2 
A 0 + a 2 - A 3 - 

2 ( A 2 A 3 ” A 0 A l) 

2 ( A 3 A 1 - A 2 A o) 

2 ( A 2 A 3 + A 0 A l) 

2 2 2 _ 2 
A 0 + a 3 a 1 a 2 


From theoretical mechanics (ref. 17, pp. 141-146) is found the following 
relationship: 


(&. V = + u v'i 

\dt j dt L ’ J 


where ^ is a vector in the C) „ or fixed system and f — | denotes the vector — 
dt xyz J ^dt J dt 

relative to the moving system 0_. Hence in terms of the quaternion transforma- 

xyz 

tion T , the relationship can be expressed as 

r 


Now, 


Since AA = 1, 


Thus, 


diW = ( —V = + r w , v'l 

dt j P \dt) dt L’ J 


dvV = A W ^ + xf — A"W’ + v'f— x) X 

dt j P [dt 1| dt \dt / \dt / 


^Yr= — -fAx^Av’+v-f^xU 

dt / P dt \dt j Idt 


[„,v]=v(|x)r-(ix)rv. 


Note that for any A in Q 


— A^ A = r* + v* 
dt 


where r* is in I and v* is in Eg. Then, from (c) of property (1) is found 

[o>, v'] = -2 [v* v] 
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or 


1 

! 


[u> + 2v* v] =0 

Since this expression is valid for all v* in Eg and since the vector cross product is 
nondegenerate, 


a} = -2v* 


Finally, 


v 


* 


Again, since XX = 1, 



Thus, 



or 



Expanding this expression gives the following matrix representation: 


V 


— — l 

0 -p -q -r 


X 0 


_ _ 1 

p 0 -r q 


X 1 

*2 

’ 2 

q r 0 -p 


X 2 

*3 


r -q p 0 


X 3 
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Equivalently, 


where 



; = l x . 

i dt ‘ 


Note that the substitution 


A = 


E 

+ m 


is equivalent to using 


vT p = pvp 


-1 


and applying the constraint 


pp = N(p) = 1 


(i = 0, 1, 2, 3) 


Normally, direction cosines are used to avoid the problem of gimbal lock. How- 
ever, this involves the three Euler angle differential equations being replaced by six 
direction cosine differential equations with three algebra equations and three constraints. 
On the other hand, representing rigid-body rotations with quaternions involves only four 
differential equations with one constraint. 


RESUME 

Some characterizations and properties of linear algebras over the field of real 
numbers are as follows: 

1. Every real algebra is a normed algebra and every real absolute valued algebra 
is a division algebra. 

2. Given a real absolute valued algebra without associativity of multiplication, one 
is not assured of the existence of an identity element. However, multiplication can be 
redefined in such a way that the resultant algebra is an absolute valued algebra with 
identity. 
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3. The algebra of real quaternions is a unique associative division algebra which 
is isomorphic and anti -isomorphic to algebras of real n x n matrices. 

4. The collection of all automorphisms on the algebra of real quaternions defines 
the group of all rotations on the real Euclidean vector space of dimension three. The 
advantage of representing rigid-body rotations with quaternions is in the elimination of 
the gimbal lock problem encountered when using Euler angles. 

Langley Research Center, 

National Aeronautics and Space Administration, 

Langley Station, Hampton, Va., September 14, 1966, 

125-23-02-04-23. 
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